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Abstract 

This note makes the obvious observation that Hoeffding's original 
proof of his inequality remains valid in the game-theoretic framework. 
All details are spelled out for the convenience of future reference. 

1 Introduction 

The game-theoretic approach to probability was started by von Mises and 
greatly advanced by Ville [5]; however, it has been overshadowed by Kol- 
mogorov's measure-theoretic approach [3]. The relatively recent book [4j con- 
tains game-theoretic versions of several results of probability theory, and it 
argues that the game-theoretic versions have important advantages over the 
conventional measure-theoretic versions. However, [3] does not contain any 
large-deviation inequalities. This note fills the gap by stating the game-theoretic 
version of Hoeffding's inequality (FJ], Theorem 2). 

2 Hoeffding's supermartingale 

This section presents perhaps the most useful product of Hoeffding's method, a 
non- negative supermartingale starting from 1. This supermartingale will easily 
yield Hoeffding's inequality in the following section. 

This is a version of the basic forecasting protocol from [3] : 

Game of forecasting bounded variables 
Players: Sceptic, Forecaster, Reality 
Protocol: 

Sceptic announces /Co S K. 

FOR n — 1,2,...: 

Forecaster announces interval bn,^ ] C R and number /i„ 6 (a, b). 
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Sceptic announces M n £ R. 

Reality announces x n £ [a n ,b n ]- 

Sceptic announces IC n < K, n -i + M n (x n — fx n ). 



On each round n of the game Forecaster outputs an interval [a-m bn 

] which, in his 

opinion, will cover the actual observation x n to be chosen by Reality, and also 
outputs his expectation (i n for x n . The forecasts are being tested by Sceptic, 
who is allowed to gamble against them. The expectation \i n is interpreted as 
the price of a ticket which pays x n after Reality's move becomes known; Sceptic 
is allowed to buy any number M n , positive, zero, or negative, of such tickets. 
When x n falls outside [a n , b n ], Sceptic becomes infinitely rich; without loss of 
generality we include the requirement x n £ [a n , b n ] in the protocol; furthermore, 
we will always assume that fi n £ (a„, b n ). Sceptic is allowed to choose his initial 
capital /Co and is allowed to throw away part of his money at the end of each 
round. 

It is important that the game of forecasting bounded variables is a perfect- 
information game: each player can see the other players' moves before making 
his or her (Forecaster and Sceptic are male and Reality is female) own move; 
there is no randomness in the protocol. 

A process is a real-valued function defined on all finite sequences 
(ai, 61, fit, Xi, . . . , o/v, bjvj R/v> %n), N = 0,1,..., of Forecaster's and Real- 
ity's moves in the game of forecasting bounded variables. If we fix a strategy for 
Sceptic, Sceptic's capital ICn, N = 0, 1,..., become a function of Forecaster's 
and Reality's previous moves; in other words, Sceptic's capital becomes a pro- 
cess. The processes that can be obtained this way are called (game-theoretic) 
supermartingales. 

The following theorem is essentially inequality (4.16) in [2]. 

Theorem 1 For any h £M., the process 



is a supermartingale. 

Proof Assume, without loss of generality, that Forecaster is additionally re- 
quired to always set /i„ := 0. (Adding the same constant to a„, b n , and fi n will 
not change anything for Sceptic.) Now we have a n < < b n . 

It suffices to prove that on round n Sceptic can make a capital of K, into a 
capital of at least 





in other words, that he can obtain a payoff of at least 
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using the available tickets (paying x n and costing 0). This will follow from the 
inequality 

/ h 2 \ e hbn - e han ( h 2 
exp f hx n - -g-(&n - a n ) 2 J - 1 < x n - - exp f — — (fr n - a n ) 2 



which can be rewritten as 



h 2 , , ,,\ e hb n _ e ha 



exp (hx n ) < exp — (b n ~ a n ) + x n — . (1) 

Our goal is to prove |T]). By the convexity of the function exp, it suffices to 
prove 

?»Z*L e M* + ^ZfH e han < exp (^-{b n - a „^2^ , - ^ - 
b n - a n b n - a n \ 8 

i.e., 



b n e ha " - a n e hK (h 



b n d r , 



,2 



< exp ( ,J --(b n - a n ) 2 



i.e., 



In (b n e ha - - a n e hK ) <^(b n - a n ) 2 + ln(6„ - a n ). (2) 
o 

The derivative of the left-hand side of @ is 

„ h P ha n — n h phb„ 
b n e ha ™ - a n e hb " 

and the second derivative, after cancellations and regrouping, is 

2 (6ne to ") {-a n e hb «) 
(b n e ha n - a n e hh n) 

The last ratio is of the form u(l — u) where < u < 1. Hence it does not exceed 
1/4, and the second derivative itself does not exceed (b n — a„) 2 /4. Inequality 
J2]) now follows from the second-order Taylor expansion of the left-hand side 
around h = 0. I 

The relation between the game-theoretic and measure-theoretic approaches 
to probability is described in [3], Chapter 8. Intuitively, the generality of the 
game-theoretic protocol stems from the fact that Forecaster is not asked to pro- 
duce a full-blown probability forecast for x n : only the elements (a„, 6„, fj, n ) that 
we really need for our mathematical result enter the game of forecasting bounded 
variables. Besides, the players are allowed to react to each other moves; in par- 
ticular, Reality may react to Forecaster's moves and both Reality and Forecaster 
may react to Sceptic's moves (the latter is important in applications to defensive 
forecasting: see, e.g., [BJ. It is remarkable that many measure-theoretic proofs 
carry over in a straightforward manner to game-theoretic probability. 
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3 Hoeffding's inequality 



We start from the definition of upper probability, a game-theoretic counter- 
part (along with lower probability) of the standard measure-theoretic notion of 
probability. Suppose the game of forecasting bounded variables lasts a known 
number N of rounds. (See [I] for the general definition.) The sample space is 
the set of all sequences (a\, b±, fi\, x\, . . . , a,N, 6jv,/ijVj ^iv) of Forecaster's and 
Reality's moves in the game. An event is a subset of the sample space. The up- 
per probability of an event E is the infimum of the initial value of non-negative 
supermartingales that take value at least 1 on E. (See [I], Chapter 8, for a 
demonstration that this definition agrees with measure-theoretic probability.) 

Theorem Q] immediately gives Hoeffding's inequality (cf. [2J, the proof of 
Theorem 2) when combined with the definition of game-theoretic probability: 

Corollary 1 Suppose the game of forecasting bounded variables lasts a fixed 
number N of rounds. If all a n and b n are given in advance and t > is a 
known constant, the upper probability of the event 

I N 

^^2(Xn-^n)>t (3) 



N 

n=l 



does not exceed 



where C := Y,n=i( b n ~ a «) 2 ■ 

(The reader will see that it is sufficient for Sceptic to know only C at the start 
of the game, not the individual a n and b n .) 

Proof The supermartingale of Theorem [T] starts from f and achieves 

N ( h 2 \ ( h 2 \ 

J|exp lh(x n - fi n ) - y(&„ - a n ) 2 J > exp f hNt - yCj (4) 

n— 1 ^ / \ / 



on the event ([3]). The right-hand side of ([4]) attains its maximum at h := 4Nt/C, 
which gives the statement of the corollary. I 

Remark The measure-theoretic counterpart of Corollary Q] is sometimes re- 
ferred to as the Hoeffding-Azuma inequality, in honour of Kazuoki Azuma [1] . 
The martingale version, however, is also stated in Hoeffding's paper ([2], the 
end of Section 2). 
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